Networked Windows NT System Field Failure Data Analysis
نویسندگان
چکیده
This paper presents a measurement-based dependability study of a Networked Windows NT system based on field data collected from NT System Logs from 503 servers running in a production environment over a four-month period. The event logs at hand contains only system reboot information. We study individual server failures and domain behavior in order to characterize failure behavior and explore error propagation between servers. The key observations from this study are: (1) system software and hardware failures are the two major contributors to the total system downtime (22% and 10%), (2) recovery from application software failures are usually quick, (3) in many cases, more than one reboots are required to recover from a failure, (4) the average availability of an individual server is over 99%,(5) there is a strong indication of error dependency or error propagation across the network, (6) most (58%) reboots are unclassified indicating the need for better logging techniques, (7) maintenance and configuration contribute to 24% of system downtime.
منابع مشابه
Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data
The discussion in this paper focuses on the issues involved in analyzing the availability of networked systems using fault injection and the failure data collected by the logging mechanisms built into the system. In particular we address: (1) analysis in the prototype phase using physical fault injection to an actual system. We use example of fault injection-based evaluation of a software-imple...
متن کاملOFTT: A Fault Tolerance Middleware Toolkit for Process Monitoring and Control Windows NT Applications
This paper describes the OFTT (OLE Fault Tolerance Technology), a fault tolerance middleware toolkit running on the Microsoft Windows NT operating system that provides required fault tolerance for networked PCs in the context of industrial process monitoring and control applications. It is based on the Microsoft Component Object Model (COM) and consists of components that performs checkpoint-sa...
متن کاملDelivery of High Quality Uncompressed Video over ATM to Windows NT Desktop
The emergence of high bandwidth applications such as medical visualization and virtual reality has exposed significant deficiencies in network, protocol, and end-system design. In this paper we discuss important endsystem issues which arise when supporting applications demanding networked delivery and manipulation of uncompressed video to the desktop. Our experimental network environment consis...
متن کاملVPARK - A Windows NT Software Platform for a Virtual Networked Amusement Park
In this paper we present the Virtual Park (or VPARK) system. This includes a Networked Virtual Environment (NVE) System, called W-VLNET and an Attraction Building System, able to create and modify attractions used in the NVES. Both systems have been developed in the Windows NT environment. The paper outlines the techniques for communication, scene management, facial and body animation, and gene...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999